The limits and robustness of reinforcement learning in Lewis signalling games
نویسندگان
چکیده
Lewis signaling games are a standard model to study the emergence of language. We introduce win-stay/lose-inaction, a random process that only updates behavior on success and never deviates from what was once successful, prove that it always ends up in a state of optimal communication in all Lewis signaling games, and predict the number of interactions it needs to do so: N3 interactions for Lewis signaling games with N equiprobable types. We show three reinforcement learning algorithms (Roth-Erev learning, Q-learning, and Learning Automata) that can imitate win-stay/lose-inaction and can even cope with errors in Lewis signaling games.
منابع مشابه
Hybrid learning in signalling games
Lewis-Skyrms signaling games (Lewis 1969; Skyrms 2010) have been studied under a variety of low-rationality learning dynamics (Barrett 2006; Barrett and Zollman 2009; Huttegger, Skyrms, Smead, and Zollman 2010; Huttegger, Skyrms, Tarrès, and Wagner 2014; Huttegger, Skyrms, and Zollman 2014). Reinforcement dynamics are stable but slow and prone to evolving suboptimal signaling conventions. A low...
متن کاملThe Limits of Reinforcement Learning in Lewis Signaling Games
We study how communication systems can emerge through repeated interaction between two individuals. We apply three reinforcement learning algorithms (Roth-Erev learning, Learning Automata, and Q-Learning) to the repeated Lewis signaling game, a game theoretic model of a communication problem. Our experiments show that each of these algorithms always reach a state of optimal communication even f...
متن کاملThe Evolution of Coding in Signaling Games
Signaling games with reinforcement learning have been used to model the evolution of term languages (Lewis 1969, Convention. Cambridge, MA: Harvard University Press; Skyrms 2006, “Signals” Presidential Address. Philosophy of Science Association for PSA). In this article, syntactic games, extensions of David Lewis’s original sender– receiver game, are used to illustrate how a language that explo...
متن کاملUtilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs
Multi agent Markov decision processes (MMDPs), as the generalization of Markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for Multi agent Reinforcement Learning. In this paper, a generalized learning automata based algorithm for finding optimal policies in MMDP is proposed. In the proposed algorithm, MMDP ...
متن کاملThe Impact of Playing Word Games on Young Iranian EFL Learners’ Vocabulary Learning and Retention
Acquiring adequate vocabulary in a foreign language is very important but often difficult. Considering the importance of learner’s vocabulary learning and retention, the present study aimed at examining the impact of playing word games on young Iranian EFL learners’ vocabulary learning and retention at Irandoostan language Institute in Tabriz. To that end, 50 female learners at the age range of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Connect. Sci.
دوره 26 شماره
صفحات -
تاریخ انتشار 2014